Mechanistic details of CRISPR-associated transposon recruitment and integration revealed by cryo-EM

Significance CRISPR-associated transposons (CASTs) show tremendous promise for genome engineering yet remain poorly understood. Here, we present the cryo-electron microscopy structure of the transposase (TnsB) from the V-K CAST element from Scytonema hofmanni (ShCAST). We determine the molecular mechanism of TnsB recruitment to the target site (via the AAA+ regulator TnsC) and the structural details of the TnsB transposase. This TnsB structure reveals architectural similarities to MuA, but also key structural differences that are significant for understanding CAST transposition. Importantly, we highlight a base-flipping mechanism for stabilizing the 5′ end of the transposon, potentially to ensure the fidelity of synaptic complex assembly. The structures presented here provide a direct target for rational, structure-guided design strategies and re-engineering of CAST elements.

Significance CRISPR-associated transposons (CASTs) show tremendous promise for genome engineering yet remain poorly understood. Here, we present the cryo-electron microscopy structure of the transposase (TnsB) from the V-K CAST element from Scytonema hofmanni (ShCAST). We determine the molecular mechanism of TnsB recruitment to the target site (via the AAA+ regulator TnsC) and the structural details of the TnsB transposase. This TnsB structure reveals architectural similarities to MuA, but also key structural differences that are significant for understanding CAST transposition. Importantly, we highlight a baseflipping mechanism for stabilizing the 5 0 end of the transposon, potentially to ensure the fidelity of synaptic complex assembly. The structures presented here provide a direct target for rational, structure-guided design strategies and re-engineering of CAST elements.
directly forms a simple insertion (16) based on the heteromeric TnsA+TnsB transposase (17). TnsA and TnsB form a protein complex for which the nuclease activities of both proteins (TnsA and TnsB) are required to generate simple insertions (17)(18)(19)(20), but the regulatory details of this process remain unresolved with Tn7 and related elements. A structure of the TnsB transposase would set the foundation for understanding the similarities that link related Tn7 and CAST elements, as well as the key differences that would explain their distinct behavior.

Results
TnsB and MuA Have Similar Architecture in the Context of the Strand-Transfer Complex. ShCAST transposition likely follows that of many other transposition systems: Pairing of the transposon ends (Fig. 1A, Left) is followed by nucleophilic attack at the transposon ends that allows them to be joined to target DNA (Fig. 1A, Middle), resulting in the product DNA, referred to here as the strand-transfer DNA (Fig. 1A Fig. S1) with which we obtained a high-resolution cryo-electron microscopy (cryo-EM) reconstruction (3.7-Å global resolution; Fig. 1D and SI Appendix, Fig. S2). Rigid-body docking of isolated domains obtained from an AlphaFold prediction (21) resulted in a nearly full-length atomic model spanning the majority of the TnsB sequence (GenBank accession no. WP_084763316.1; Fig. 1C). TnsB forms a C2symmetric tetrameric assembly organized around the strandtransfer DNA (Fig. 1D and E and Movie S1). The overall architecture and arrangement of functional domains are remarkably similar to the MuA STC (22) (Fig. 1F and G and SI Appendix, Fig. S3). MuA is a well-studied RNaseH transposase that is responsible for bacteriophage Mu integration. In the representative view shown ( Fig. 1F and G), both complexes resemble an "X," where the upper half of the complex consists of the target DNA (blue, Fig. 1F and G) and the lower half consists of the transposon ends (green, Fig. 1F and G). Both MuA and TnsB cleave the donor DNA in trans-the subunit whose DNAbinding domain interacts with DNA on the right-hand side of the complex (tan subunit, Fig. 1D and E) positions the catalytic domain to interact with the DNA on the left side of the target-donor junction and vice versa ( Fig. 1D and E). Furthermore, both left and right halves of the complexes are identical, with each half containing two protein chains, each in different conformations that are determined by where they bind on the DNA substrate. The two TnsB binding sites on the strand-transfer DNA substrate are referred to as L1 and L2 (because the designed DNA substrate used ShCAST left ends). The corresponding TnsB conformers are distinguished by which TnsB binding site they occupy (Movie S1), and hence the TnsB monomer bound to L1 is referred to as B-L1, and TnsB bound to L2 is referred to as B-L2 ( Fig. 1C and D; both are described in more detail below).
We have assigned TnsB domain names following MuA domain names (Fig. 1C) (22), given the remarkable similarities between the STC structures, in order to facilitate structural comparisons. Domains Iβ, Iγ, and IIβ are DNA-binding domains ( Fig. 1C and Movie S1), domain IIα is the catalytic domain, and, finally, domains IIIα and IIIβ span the TnsB C terminus, which will be discussed in detail in the following sections. The B-L1 conformation includes residues 29 to 474 and is positioned at the target-donor junction (tan and light purple, Fig. 1C and D and Movie S1). The second distinct conformation, B-L2, spans residues 196 to 519 (orange and dark purple, Fig. 1C and D) and binds the second TnsB binding site (L2).  Fig. 1G), possibly due to the choice of substrate (our DNA contains an incomplete L2 TnsB binding site). We also observe structural differences between ShCAST TnsB and MuA assemblies. One example is how the tetrameric architecture is stabilized, most notably in the placement of helix IIIα (red asterisk, Fig. 1F and G). In MuA, helix IIIα adopts two different configurations in the R1-(purple square, Fig. 1G) and R2-bound MuA subunits (yellow triangle, Fig. 1G). In contrast, in the TnsB STC, helix IIIα appears to stabilize the tetramer by making different intersubunit interactions (Movie S1). B-L2 helix IIIα (red asterisk, Fig. 1F) wraps around the back of domain IIα of B-L1 (light purple, Fig. 1E) to nestle between the B-L1 (light purple and tan) subunits, forming interactions with both ( Fig. 1D and H). In addition to helix IIIα, we observe intersubunit interactions between domain Iβ in B-L1 (tan, boxed in Fig. 1I) and domain IIβ in B-L2 (orange, Fig. 1I). Here, B-L1 domain Iβ completes a β-sheet within B-L2 domain IIβ (Fig. 1I). Therefore, while the TnsB STC contains many conserved features to ensure fidelity of synaptic complex assembly, it appears to have evolved different protein-protein interactions to hold the tetrameric assembly together compared with those found in the MuA STC.

ShCAST Transposase Recruitment Occurs via Interactions
between TnsC and TnsB's C Terminus. We do not observe any ordered structure past domain IIIα in our TnsB STC structure (Fig. 1C), consistent with the disorder prediction in this region ( Fig. 2A). Nevertheless, this is particularly interesting given the role of the transposase C terminus in both prototypic Tn7 and Mu. The last 22 residues of TnsB (residues 681 to 702) in prototypic Tn7 are essential for the TnsB-TnsC interaction and transposition (23). For Mu, the C terminus of MuA is crucial for stimulating adenosine triphosphate (ATP) hydrolysis (24) and disassembly of MuB filaments (the AAA+ protein providing a function analogous to ShCAST TnsC) (25,26), implying that MuA C-terminal interactions with MuB are also relevant for MuA transposition. Motivated by the remarkable structural and functional similarities between MuB and ShCAST TnsC (12), we reasoned that the C-terminal 109 residues of TnsB (spanning domains IIIα and IIIβ, which we refer to as TnsB CTD ; Fig. 2A) are most likely to interact with the TnsC filament. Because TnsB, like MuA, stimulates TnsC filament disassembly in a nucleotide-dependent manner ( Fig. 2B) (12), we reasoned that full-length TnsB would not form a stable complex with TnsC filaments suitable for high-resolution structure determination, so we instead pursued structural characterization with TnsB fragments. In order to capture a homogeneous "recruitment-like" state, we added TnsB CTD in excess to AMPPNP-bound TnsC, which forms continuous helical filaments on target DNA (12).
The cryo-EM reconstruction of the TnsC filament coated with TnsB CTD revealed side-chain density features (3.5-Å resolution) corresponding to 14 residues decorating the surface of TnsC filaments ( Fig. 2C and SI Appendix, Fig. S4). Atomic modeling into this density (SI Appendix, Fig. S5) indicated that this portion of TnsB most likely corresponds to the last 15 residues of TnsB (the last residue is not modeled), or residue positions 570 to 584, which we refer to as TnsB Hook ( Fig. 2A and D). Subsequent cryo-EM reconstruction of the TnsB Hook peptide (residues 570 to 584; Fig. 2A) in the presence of TnsC filaments resulted in a reconstruction indistinguishable from the previous one, except for slight resolution differences (3.5 vs. 3.8 Å for the TnsB CTD vs. TnsB Hook reconstructions, respectively; SI Appendix, Fig. S6), confirming the TnsB Hook sequence register assignment. The lack of additional density corresponding to TnsB CTD in our cryo-EM reconstruction suggests positions outside the structured TnsB Hook do not make specific contacts with the TnsC filament, which is consistent with TnsB disorder predictions ( Fig. 2A). Taken together, the most parsimonious explanation for this is that the TnsB Hook represents a structured interaction with TnsC connected by a flexible linker to the rest of the full-length TnsB. Deletions of either the TnsB CTD (ΔCTD, or equivalently TnsB ΔCTD , corresponding to residues 1 to 475; Fig. 2A) or the TnsB Hook (ΔHook, or TnsB ΔHook , corresponding to residues 1 to 569; Fig. 2A) result in loss of transposition activity (Fig. 2E).
ShCAST target-site selection relies on the stimulation of TnsC filament disassembly by TnsB-promoted ATP hydrolysis to allow guide RNA-directed transposition ( Fig. 2B) (12). Therefore, we wondered whether interactions with the TnsC filament were sufficient for hydrolyzable nucleotide-dependent filament disassembly, as observed in Mu (24,27). However, none of the TnsB N-terminal (TnsB ΔHook : 1 to 569; TnsB ΔCTD : 1 to 475) or C-terminal fragments (TnsB Hook : 570 to 584; TnsB CTD : 476 to 584) we assayed were sufficient to recapitulate the disassembly phenotype observed with full-length TnsB with ATP using EM imaging (Fig. 2F) or biochemical assays that track TnsC oligomerization on DNA (SI Appendix, Fig. S7), at least at concentrations for which full-length TnsB is effective at stimulating TnsC filament disassembly. Therefore, in contrast to Mu (24), this suggests that one or more additional interactions between TnsB and TnsC, in addition to that made with the TnsB Hook , are required in order to stimulate ATP hydrolysis and filament disassembly in ShCAST. Although a MuA-MuB structure is not available, the interaction surface between TnsB and TnsC appears to colocalize to the same interaction surface mapped to MuB, assuming positions between TnsC and MuB are roughly equivalent (SI Appendix, Fig. S8A) (27,28). Nevertheless, the lysine residues responsible for mediating transposase interactions in MuB do not appear conserved (SI Appendix, Fig. S8B), suggesting that the nature of interactions between the transposase and its AAA+ regulator varies across transposition systems.
Together, these results paint a picture of the initial steps of TnsB recruitment to the target site via the AAA+ regulator, TnsC. TnsB's C-terminal hook interacts with TnsC along the surface of the filament, but interaction via the TnsB Hook alone is insufficient to stimulate TnsC filament disassembly, indicating that one or more additional interactions between TnsB and TnsC not visualized here must be required. In addition, we is lost with C-terminal deletion constructs, TnsB ΔHook or TnsB ΔCTD , comparable to the case in which no transposase is added (indicated with "-"). Transposition activity is assessed via the number of kanamycin-resistant (KanR) colonies (see Materials and Methods for details). Data are represented by the mean; error bars indicate SD (n = 3, technical triplicates). Raw data points are shown in red. (F) Negative-stain EM images indicate that TnsC filaments are disassembled in a nucleotide-dependent manner by full-length TnsB (compare AMPPNP and ATP), but not by any of the TnsB fragments tested. (Scale bars, 500 Å.) reveal that the TnsB C-terminal hook is flexibly linked to the rest of TnsB. The flexible linker is not conserved in length or sequence among TnsB homologs from the V-K CAST elements (SI Appendix, Fig. S9). Nevertheless, given the relatively precise insertion spacing observed in ShCAST (4), it may play crucial roles in orienting TnsB to interact productively with the target site.
The TnsB Strand-Transfer Complex Stabilizes Highly Distorted DNA. DNA distortions, particularly in the target-bound DNA, are canonical features of RNaseH transposase structures. The TnsB STC has highly distorted DNA (120°bend; Fig. 3A and B) surrounding the 5-bp target site (brown, Fig. 3A) comparable to MuA (22). Target DNA distortions are required to place the scissile phosphate appropriately in the active site (29). Consistent with this, the DDE catalytic residues (D205, D287, and E321; Fig. 3C) are positioned at the target-donor junction precisely at the DNA distortion (red star, Fig. 3B), coordinating a magnesium ion with the scissile phosphate poised for nucleophilic attack (Fig. 3C). Mutation of the catalytic residues significantly reduced transposition activity (Fig. 3D). Surprisingly, the D205 mutation did not completely abolish transposition, but there is no immediately nearby acidic residue that can compensate for the role of D205 (the closest Asp/Glu residue is D291, which is 7.2 Å away from the Mg 2+ ion). Thus, it requires further investigation to understand how the D205A mutant can still carry out transposition.
In MuA, helix IIIα of the R1-bound subunit (light purple, indicated with a purple square, Fig. 1G) has additional roles in stabilizing target-DNA distortions and preventing reversal of the reaction (30). The absence of a similar interaction in the TnsB STC structure (Fig. 1F) suggests that the role of helix IIIα in TnsB may primarily be for tetramer stabilization rather than stabilizing target-DNA distortions. Consistent with this, in TnsB the domain IIβ close to the target DNA is closely interacting with the sugar-phosphate backbone, whereas the equivalent domain in MuA is too far to interact with the target DNA (SI Appendix, Fig.  S10). This suggests that target-DNA distortions in TnsB are stabilized via a different DNA-binding domain, namely domain IIβ.

TnsB Interactions with Donor DNA Delineates Transposon
End Recognition. Tn7-like elements have an 8-bp terminal sequence (gray, adjacent to the target-site duplication, and target site in brown, Fig. 3E) (2). In our structure, the 8-bp terminal sequence (closest to the target-site duplication and colored gray, Fig. 3E) corresponds to the part of the DNA substrate contacted by the catalytic domain (domain IIα; SI Appendix, Fig. S11) and can be assigned to the contacts between the B-L1 subunit and target DNA near the target-donor junction (Fig.  3F). Transposon cargo and Tn7/CRISPR-associated genes are flanked by left and right ends, consisting of multiple 22-bp TnsB binding sites (1, 2, 31) (blue triangles, Fig. 3E). In order to understand the protein-DNA interactions that enable TnsB to recognize its cognate DNA sequence, we looked at DNAbinding domains Iβ and Iγ which bind along the first TnsB binding site on the donor DNA (L1; Fig. 3F and G). The majority of protein-DNA interactions are sequence-nonspecific contacts with the sugar-phosphate backbone (Fig. 3F). However, several key residues located in the Iγ domain and in the Iβ-Iγ linker form sequence-specific nucleobase contacts. Within Iγ, R158 and K154 are within hydrogen-bonding distance of G À11 and G 9 , respectively (Fig. 3H). Interestingly, the Iβ-Iγ linker lies along the minor groove of the DNA duplex and contributes sequence-specific contacts. R106 and R99 are within hydrogen-bonding distance of T À14 and T 16 , respectively (Fig. 3I). The Iγ and Iβ-Iγ linker makes contact with nucleotide positions 5 to 19, which is roughly consistent with the pattern of conservation among TnsB binding sites (SI Appendix, Fig. S12). Although some base-specific interactions are observed in the Iβ domain (R58, R77, and R81), the lack of strong conservation in the TnsB donor sequence in this region (positions 20 to 30; SI Appendix, Fig. S12) suggests that these residues may not strongly contribute to transposon end recognition. Therefore, the TnsB STC structure suggests that transposon end DNA recognition may be modular (i.e., independent and separable from catalytic function) in TnsB, like MuA (32), and could feasibly be altered using rational design strategies, as has been done in the past with MuA via the generation of a chimeric recombinase called "SinMu" (33).
TnsB Forms Specific Base-Stabilizing Contacts in the Nontransferred Strand. Unlike prototypic Tn7 or other CAST elements (such as the I-F3 subfamily), ShCAST (and other V-K CAST elements) do not encode enzymatic activity for cleavage at the 5 0 ends of the element (i.e., it does not encode TnsA) (15). Consistent with this, CAST V-K elements form cointegrates indicative of replicative transposition without subsequent resolution (14). Therefore, we were particularly intrigued to discover a unique structural conformation at the 5 0 ends of the transposon (and missing in MuA) with the nontransferred strand (Fig. 4A). The linker connecting domains Iγ and IIα in the L1-bound TnsB subunit snakes underneath each 5 0 end of the element in the nontransferred strand (Fig. 4A), forming stabilizing interactions (Fig. 4B) with the first two nucleotide positions. We observe "melting" of the 5 0 end of the nontransferred strand through a flipped-out base (T 1 ; Fig. 4B). This specific conformation is stabilized by aromatic interactions with W178 and hydrogen bonding with S175 and R380 (Fig. 4B). Mutation of residues observed to interact with the nontransferred strand results in almost complete abrogation of transposition activity (Fig. 4C), highlighting the importance of the observed interactions.
We wondered whether specific interactions at the ends of the element were consistent with additional flanking DNA from the donor plasmid, as would be expected given TnsB's transposition mechanism (Fig. 1A). Modeled flanking DNA from the 5 0 end of the transposon is sterically accommodated within our existing structure (Fig. 4D), indicating that the DNA substrate we used here is consistent with formation of TnsB cointegrates. Therefore, it appears that the specific structural feature we observe at the 5 0 end of the element is both important and consistent with TnsB's expected transposition substrate. We postulate that the melting of the 5 0 nontransferred strand may serve as a regulatory step that ensures the fidelity of synaptic complex assembly.

Discussion
The structures reported here include an STC of a Tn7-like CAST element, and also highlight the remarkable consistency across the catalytic domains of RNaseH transposases, specifically with respect to the Mu transposase (22), despite distant evolutionary relationships. This high degree of structural conservation across considerable evolutionary distance leads us to propose that TnsB from prototypic Tn7 adopts an architecture similar to ShCAST TnsB and MuA upon integration. While not addressed in this work, multiple internal TnsB binding sites found asymmetrically in the left and right ends (Fig. 3E)     these elements (3,4,34). Therefore, a lingering mystery for ShCAST and related transposons is how placement of internal binding sites establishes the orientation and fidelity of synaptic complex assembly.
AlphaFold predictions of the catalytic domain (domain IIα) of prototypic Tn7 TnsB superimposes well onto ShCAST TnsB (2.4 Å rmsd; SI Appendix, Fig. S13A). Interestingly, the region known to interact with TnsA in the prototypic Tn7 system (19) localizes to where flanking host DNA would be located (SI Appendix, Fig. S13B). Given this is the position where the TnsA nuclease would need to localize in order to generate 5 0 end cuts for generating simple insertions, this structure suggests that manipulation of ShCAST transposon characteristics via structure-based engineering is practically achievable.
The structural features we observe at the 5 0 transposon end in the STC structure (Fig. 4B) have also been similarly observed in the RAG1-RAG2 synaptic complex in which a base-flipping mechanism is important for end recognition and stabilization of the heptameric RSS sequence (35). In contrast, analogous base flipping is not observed in the MuA structure (22), which is not completely modeled in this region. The absence of this feature in MuA is either a result of lack of resolution (due to anisotropic resolution) or, alternatively, that Mu does not stabilize nicked ends in an identical manner compared with ShCAST TnsB. Further research will be required to understand the exact functional role for base flipping in these elements.
The structural work described here also sheds light onto the process of transposase recruitment to the target site for ShCAST and related transposition systems. We demonstrate that physical association between TnsB CTD and TnsC is primarily via the C-terminal hook that is capable of decorating TnsC filaments (Fig. 2). A total of 50 residues (520 to 569) are not observed in either our TnsB CTD -TnsC structure nor the TnsB STC structure, and are consistent with predictions of disorder based on primary sequence ( Fig. 2A). This suggests that this particular region of TnsB remains flexible and without structure, at least in the states we have captured here. This is consistent with a model in which a second interaction between TnsB and TnsC is required to recapitulate nucleotide-dependent TnsC filament disassembly, which is observed with full-length TnsB but not the TnsB fragments that we used to decorate TnsC. Such interactions may also be needed to activate the otherwise latent transposition activity in ShCAST TnsB. While the structures here reveal mechanistic insight into TnsB function and provide a basis for ShCAST engineering, this work also uncovers exciting questions centered on ShCAST transposon structure and function that will remain fascinating topics for future investigations.

Materials and Methods
Strand-Transfer Complex Reconstitution. The strand-transfer DNA substrate was prepared by annealing three oligonucleotides, heating to 95°C, and then cooling slowly to room temperature in annealing buffer (SI Appendix for composition) (SI Appendix, Table S2). The strand-transfer DNA substrate and purified TnsB were mixed in a 1:6 molar ratio with the following final buffer composition: 26 mM Hepes (pH 7.5), 5 mM TrisÁHCl (pH 7.5), 20 mM KCl, 100 mM NaCl, 0.2 mM MgCl 2 , 15 mM MgOAc 2 , 3% glycerol, and 1.5 mM dithiothreitol (DTT). After incubation at 37°C for 40 min, the sample was concentrated to ∼7 mg/mL using an Amicon Ultra centrifugal filter (50-kDa molecular weight cutoff, EMD Millipore); 250 μL of the concentrated sample was subjected to size-exclusion chromatography (Superdex S200 Increase 10/300 GL, Cytiva). Peak fractions from 9.2 to 10.7 mL were collected for cryo-EM sample preparation (SI Appendix, Fig. S1).
TnsB CTD -TnsC Complex Preparation. TnsB and TnsC were purified following previously described protocols (4,12). Protein truncation constructs consisting of TnsB's 109 C-terminal residues (referred to throughout as TnsB CTD ) were cloned from the ShTnsB vector (Addgene, 135525) and purified using previously described protocols (4,12). To prepare the TnsB CTD -TnsC complex for cryo-EM imaging, TnsC filaments were formed by mixing purified TnsC with a 1:10 molar ratio of a 22-bp double-stranded DNA (dsDNA) substrate (SI Appendix, Table S2; see SI Appendix for more details). TnsC was allowed to polymerize on ice for 5 min before adding purified TnsB CTD at a twofold molar excess with respect to TnsC.
Cryo-EM Sample Preparation and Imaging. Slightly different sample preparation protocols were used for the two samples (referred to as TnsB STC and TnsB CTD -TnsC) described in this manuscript. For the TnsB STC, homemade graphene oxide (GO) grids were used (SI Appendix for fabrication details); 4 μL of reconstituted TnsB STC was loaded onto the carbon side of freshly fabricated GO grids. The sample was incubated on the grid for 20 s in the chamber of a Mark IV Vitrobot (ThermoFisher), which was set to 4°C and 100% humidity. Grids were blotted using a blot force of 5 and blot time of 7 s prior to being plunged into liquid ethane cooled by liquid nitrogen. For the TnsB CTD -TnsC, R1.2/1.3 gold grids (UltraAuFoil, Quantifoil) were glow-discharged (PELCO easiGlow) using a 30-mA current for 30 s prior to sample application and vitrification; 4 μL of freshly reconstituted TnsB CTD -TnsC sample was applied to the gold grid. Vitrification conditions followed that of the TnsB STC (see above).
Vitrified samples were imaged using a Talos Arctica (ThermoFisher, operated at 200 keV) equipped with a BioQuantum energy filter (Gatan) and a K3 direct electron detector (Gatan). The microscope was subjected to stringent alignment procedures, including coma-free alignment and parallel illumination (36). Highthroughput imaging was achieved using a 3-by-3 image shift in SerialEM (37). Image magnification settings corresponded to 63,000× magnification (1.33 Å per pixel scaling) and a nominal defocus range of À1.0 to À2.5 μm. Comprehensive imaging parameter details are presented in SI Appendix, Table S1. Image Processing. Warp (38) was used for micrograph preprocessing, including beam-induced motion correction, contrast transfer function (CTF) estimation, and initial particle picking. For the TnsB STC, a C1 ab-initio reconstruction was generated using cryoSPARC (39). At this point, the resulting reconstruction had apparent C2 symmetry, therefore C2 symmetry was imposed for all subsequent refinement steps. For the TnsB CTD -TnsC reconstruction, a 20-Å low pass-filtered map of the ATPγS-bound TnsC cryo-EM reconstruction (EMD-23720) (12) was used as an initial reference for cryoSPARC helical reconstruction and refinement (39). Roughly the same refinement procedure was applied to both datasets: cry-oSPARC particle alignment parameters and stacks were exported to RELION (40,41) for subsequent refinement, including three-dimensional classification, CTF refinement (42), and Bayesian polishing (43). The final TnsB STC reconstruction had an estimated resolution of 3.7 Å (gold standard Fourier shell correlation, [FSC]) and, in the case of the TnsB CTD -TnsC reconstruction, 3.5-Å resolution. More comprehensive methodological details are presented in SI Appendix.
Atomic Model Building. Different modeling procedures were used for the TnsB STC and TnsB CTD -TnsC filament cryo-EM reconstructions. For the TnsB STC cryo-EM map, the TnsB sequence was used to generate an AlphaFold2 (21) prediction. The top-ranked model was split by domain and manually docked into the cryo-EM map using UCSF Chimera (44). One half of the complex, containing two distinct conformations of TnsB and DNA, was completed manually using Coot (45) and C2 symmetry was used to generate the full complex. This was followed by manual inspection and further refinement using Coot (45). The full assembly was energy-minimized in the context of the cryo-EM map using Rosetta (46). Protein and DNA geometry was subjected to Phenix real-space refinement (47).
In the case of the TnsB CTD -TnsC filament cryo-EM reconstruction, TnsC and DNA models from the ATPγS-bound TnsC filament (Protein Data Bank [PDB] ID code 7M99) served as very close initial models. Small adjustments in the TnsC model were made using Rosetta energy minimization, employing helical symmetry to model two helical turns using a single asymmetric unit, as described previously (12). In order to identify the register of the TnsB Hook fragment, a 14-residue polyalanine backbone was first built into the density. A custom script was used to thread all 96 possible registers, representing each possible threaded sequence spanning the 109-residue TnsB CTD construct (109 À 14 + 1 = 96 possible registers), onto the TnsB fragment backbone. Each initial model was then relaxed into the density independently, using Rosetta energy minimization (46). Additional Rosetta energy terms to assess atomic model-map fit (elec_dens_fast weight = 40) were enforced during refinement (SI Appendix, Fig. S5), and 30 models were generated for each energy minimization run. The best scoring model was used to assess the sequence register, as shown in SI Appendix, Fig. S5. Details of the model statistics and validation are presented in SI Appendix, Table S1.
TnsC Filament Disassembly Assay. TnsC disassembly was probed using two different assays: an EM-based imaging assay and a biochemical assay. The imaging assay was carried out as follows: TnsC and 60-bp dsDNA (SI Appendix, Table  S2) were added at a 25:1 molar ratio into the following buffer: 2 mM nucleotide (ATP or AMPPNP), 25 mM Hepes, 200 mM NaCl, 2% glycerol, 1 mM DTT, and 2 mM MgCl 2 in order to initiate filament assembly. Filaments were then either incubated with full-length TnsB or TnsB truncations (1:1 molar ratio of TnsC to TnsB) or an equivalent volume of the buffer as a negative control. Each reaction was incubated at 30°C for 1 h, followed by negative-stain EM. For the biochemical assay, desthiobiotinylated DNA was incubated with streptavidin magnetic beads. TnsC filament assembly was initiated (as described above) and then TnsB (full-length or truncation constructs) was added to the reaction mixture. After multiple washes, DNA was eluted from the beads using a 4 mM biotin solution and the associated proteins were examined using sodium dodecyl sulfate-polyacrylamide gel electrophoresis. SI Appendix includes more comprehensive details.
Data Availability. Electron density maps and atomic models reported in this article have been deposited in the Protein Data Bank and Electron Microscopy Data Bank (EMDB) (EMDB ID code 25454 TnsB CTD -TnsC filament, EMDB ID code 25455 TnsB STC, EMDB ID code 27140 TnsB Hook -TnsC filament, PDB ID code 7SVV TnsB CTD -TnsC filament, and PDB ID code 7SVW TnsB STC).
All study data are included in the article and/or supporting information.